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Abstract 

We discuss some seemingly paradoxical yet valid effects of quantum physics in in- 
formation processing. Firstly, we argue that the act of "doing nothing" on part of an 
entangled quantum system is a highly non-trivial operation and that it is the essential 
ingredient underlying the computational speedup in the known quantum algorithms. 
Secondly, we show that the watched pot effect of quantum measurement theory gives 
the following novel computational possibility: suppose that we have a quantum com- 
puter with an on/off switch, programmed ready to solve a decision problem. Then (in 
certain circumstances) the mere fact that the computer would have given the answer if 
it were run, is enough for us to learn the answer, even though the computer is in fact 
not run. 

1 Introduction 

Many recent developments in quantum computation are motivated by existing results in 
theoretical computer science, adapted and rewritten in a quantum context. This includes 
much of the recent work on quantum error correcting codes (see for example ||, [|, [f|) 
and also the idea of using the Fourier transform to determine periodicity, which underlies 
many of the known quantum algorithms IIJ. There are relatively few results (such as Q) 
with no classical analogue, motivated intrinsically from considerations of physics. This is a 
curious situation considering that the entire subject of quantum computation derives from 
differences between the classical and quantum laws of physics. Apart from the computer 
science benefits of providing more efficient computation, an important fundamental aspect 
of the subject is the insight that it might provide for a deeper understanding of the quantum 
laws and their origins. Computer science and information theory provide an entirely new 
conceptual framework for considering this question of physics. Thus we will consider the 
question: what are the essential physical effects that give rise to the known computational 
speedups? And is it possible to use other differences between quantum and classical physics 
for novel computational possibilities? 
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2 Quantum Information Processing and Entanglement 



It is often said that the power of quantum computation derives from the superposition 
principle - the ability to do different computations in parallel in superposition, and combine 
the results with cleverly arranged interferences. But this explanation is not precise enough 
because classical waves also exhibit superposition and any effect of superposition can be 
mimicked by a classical wave system. However there is an essential difference between 
classical and quantum superposition, which lies in the different way that the two physical 
theories describe composite systems ||. 

Consider n two- level systems. In the classical case we may for example think of each sys- 
tem as comprising the two lowest energy modes of vibration of a string with fixed endpoints 
together with all superpositions. According to the laws of classical mechanics, the total 
state space of the composite system is the Cartesian product of the n subsystem spaces. 
Thus no matter how much the strings interact in their physical evolution, the total state is 
always a product state of the n separate systems. Hence we can say that the information 
needed to describe the total state grows linearly with n (being n times the information 
needed to describe a single subsystem). 

In contrast, according to the laws of quantum mechanics the total space is the tensor 
product of the subsystem spaces and a general state may be written as 



Thus generally we will have 0(2 n ) superposition components present and the information 
needed to describe the total state will grow exponentially with n. The novel quantum 
effect here - the passage from Cartesian to tensor product - is precisely the phenomenon 
of entanglement i.e. the ability to superpose general product states. 

As stated above, quantum entanglement can be readily mimicked by classical wave 
systems: instead of taking n two-level systems, we consider a single classical wave system 
with 2 n levels, allowing general superpositions of all these levels, and merely interpret these 
as entangled states via a chosen mathematical isomorphism between <8> n V2 and V^n (where 
Vfc is a fc-dimensional vector space) . However this mathematical isomorphism is not a valid 
correspondence for considerations of complexity (i.e. in which we assess the utilisation of 
physical resources): if the 2 n classical levels are, say, equally spaced energy modes, then to 
produce a general state in V^^ we will need to invest an amount of energy exponential in 
n, whereas a general state in ® n V2 will require only a linear amount of energy (as at most, 
each of the n two-level systems will need to be excited). The essential point here is that 
entanglement allows one to construct exponentially large superpositions with only linear 
physical resources and this cannot be achieved with classical superposition. 

In the sense described above the state \ip n ) can encode an exponentially large amount 
of information. This would be of little consequence if we could not process the information 
in a suitably efficient way. Fortunately the laws of quantum physics allow precisely this 
possibility, which appears to be at the heart of the computational speedup exhibited by the 
known quantum algorithms. Suppose we apply a one-qubit gate U to the first qubit of the 
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entangled state \ip n ). This would count as just one step of quantum information processing 
but to compute the result classically (say by matrix multiplication) we would calculate the 
new amplitudes by 

1 

%i— in = Uj 1 idij 2 ...j n (2) 

i=0 

where Uji is the unitary matrix for U. Now, this computation involves exponentially many 
steps: the 2x2 matrix multiplication of U needs to be performed successively 2 n_1 times for 
all possible values of the indices j'2 • • • jn- Although the action of U on qubit 1 is a physically 
simple operation, it is represented mathematically as a tensor product U (8) Ii ® • • • ® / 2 
(where I2 is the identity matrix which represents "doing nothing" on qubits 2 to n) and 
hence mathematically it becomes an exponentially large unitary operation. Thus because 
of the tensor product rule we can (somewhat enigmatically) state the principle: 

(PI): The physical act of doing nothing on part of an entangled composite system is a 
highly nontrivial operation. It leads to an exponential information processing benefit if 
used in conjunction with performing an operation on another (small) part of the system. 

Indeed it is difficult to process the quantum information by only a "small amount". Eq. 
@ illustrates that any small local operation (addressing a small part of the system) will 
generally correspond to an exponentially large processing operation from a classical point of 
view. Intuitively this reflects the denseness of the exponential quantum information stored 
within the linear resources. 

One may object to (PI), claiming that surely the information processing gain arises 
from the local operation that is actually performed (e.g. U above) rather than from the 
part that is not performed (e.g. the (n — 1) identity operations above)! To see that this is 
not the case consider our row of n qubits and suppose now that U operates on the first k 
qubits (so U is a 2 k x 2 k matrix). Let us compare the number of steps required to perform 
this transformation in the classical and quantum contexts respectively. It is known that 
any d x d unitary matrix may be programmed on a quantum computer in 0(d 2 ) steps 
|S so the quantum implementation of U will require 0((2 k ) 2 ) steps. Classically, direct 
matrix multiplication for a d x d matrix requires 0{d 2 ) steps. For U we have d = 2 k and the 
multiplication must be performed 2 n ~ k times. Thus the classical implementation will require 
0{(2 k ) 2 2 n ~ k ) steps. Hence the ratio of quantum computing effort to classical computing 
effort is 0(2 k /2 n ). This ratio decreases if either n is held fixed and k is decreased, or k 
is held fixed and n is increased. In either case we are increasing the proportion of "doing 
nothing" and this is giving rise to an increased information processing benefit. 



The Fourier transform is a fundamental ingredient M, 17, Qq] in most of the known 



quantum algorithms which exhibit a super-classical computational speedup. This includes 



the algorithms of Deutsch fiO|| , Simon 11], Shor |12| , |13|| and Grover [15]. Using the mathe- 
matical formalism of the fast Fourier transform (FFT) [ 14 ] , the unitary transformation that 
is the Fourier transform can be implemented exponentially more efficiently in a quantum 
context than in any known classical context. For example, for the group of integers 
modulo 2 n the classical fast Fourier transform algorithm runs in time 0(n2 n ) whereas its 
quantum implementation runs in time 0(n 2 ). An analysis of the implementation of the FFT 
algorithm in the quantum context, given in detail in f|, shows that the achieved exponential 
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speedup may be entirely attributed to the influence of the principle (PI). This appears to 
be an essential feature of the speedup exhibited by all known quantum algorithms. 

The full (exponentially large) amount of information embodied in the identity of a quan- 
tum state \ip n ) is termed "quantum information". The formalism of quantum mechanics 
places an extraordinary limitation on the above entanglement-related benefits of quantum 
information storage and processing: quantum measurement theory implies severe restric- 
tions on the accessibility of the quantum information in the state. For example, according 
to Holevo's theorem || we can obtain at most n bits of information about the identity of an 
unknown state \ip n ) of n qubits by any physical means whatever. This bound is the same as 
the information capacity of a classical system with the same number of levels. Thus, curi- 
ously, natural physical evolution in quantum physics corresponds to a super-fast processing 
of (quantum) information at a rate that cannot be matched by any classical means, but 
then, most of the processed information cannot be read! It is a remarkable fact that these 
two effects do not anull each other - the small amounts of information that are possible to 
obtain about the identity of the final processed state do not coincide with the particular 
meagre kinds of information processing that can be achieved by classical computation on the 
input running for a similar length of time. This disparity directly entails the computational 
speedup possibilities of quantum computation. 

3 Counterfactual Quantum Computation 

We have argued above that the information processing benefits seen in the known quantum 
algorithms all rest on some specific features of quantum entanglement. However these fea- 
tures do not exhaust all the ways in which quantum physics differs from classical physics. In 
an effort to find new quantum algorithms we might ask whether other non-classical features 
of quantum physics may be exploited for novel computational possibilities (not necessarily 
just a speedup of computation). Quantum measurement theory (c.f. the inaccessibility of 
quantum information mentioned above) provides further non-classical aspects of the quan- 
tum formalism and these are also related to controversial interpretational issues. We will 
now describe a novel computational possibility which we call "counterfactual quantum com- 
putation", based on properties of quantum measurement. 

A counterfactual effect may be defined as an observable physical effect E whose outcome 
depends on an event A that might conceivably have happened but in fact did not happen 
i.e. E is affected by the mere existence of A as a valid possible alternative even though A 
did not actually occur. Classical physics does not allow physically observable counterfactual 
effects but quantum physics does, at least in the sense described below. Their surprising and 
somewhat paradoxical occurrence in quantum mechanics has been highlighted in Penrose 
H| (see especially §§ 5.2, 5.3, 5.7, 5.8, 5.9, 5.18). 

Suppose that we have a quantum computer which has been programmed ready to solve 
a decision problem. The computer also has an on/off switch, initially set in position off. We 
will show that in certain circumstances, the mere fact that the computer would have given 
the result of the computation if it were run, is sufficient to cause a physically measureable 
effect from which we can learn the result, even though the computer is in fact not run\ Our 
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method is based on the so-called Elizur-Vaidman bomb testing problem [21] and the essential 
idea may be clarified by considering the operation of a simple Mach-Zender interferometer, 
which we discuss first. 

Consider the Mach-Zender interferometer as shown in the following diagram. 



Q 




Here HI and H2 are beam splitters and Ml and M2 are rigid perfect mirrors. The action 
of each beamsplitter is taken to be the following (written in terms of the states labelled at 
H2). For horizontal photons 

|t/)^-L(|F) + |G» (3) 

and for vertical photons 

\L) - -j=(\F) - |G» (4) 

A photon enters at \A) and is separated into a superposition ^(1-^) + \U)) of upper and 
lower paths. In the absence of the measuring instrument M the two beams coherently 
interfere at H2 and according to eqs. @ and (Q) the result is \F). Thus the photon is 
always registered in detector T and never in detector Q. 

Consider now a nondestructive measurement device A4 placed in the lower arm, which 
registers whether or not the photon passed along that arm. The initial state of A4 is \Mq) 
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and if a photon is registered it becomes an orthogonal state \M\). Following the photon we 
now have 

\A) - ±{\U) + \L)) |M ) - ±={\U) |M ) + |L) |M X )) (5) 

and the last state may be thought of as the "collapsed" mixture of \U) \Mq) or \L) \M\), 
each with probability half. Thus the interference at H2 is spoilt and we always have a 50/50 
probability of registering the photon in either T or Q. 

Suppose now that the photon is registered in Q and the measurement instrument is 
seen to be in state \Mq). (This event occurs with probability \.) Thus the photon has been 
registered absent in the lower arm and the measurement instrument, having thus apparently 
done nothing, remains in state \Mq). Yet the photon is seen at Q, which is forbidden in the 
absence of Ml Although Ai apparently does nothing, it cannot be removed, since then the 
photon can never register in Q. This is our fundamental counterfactual effect: we can say 
that the photon can be registered in Q because if the photon would have gone along the 
lower path, it would have been detected, even though it did not, in fact, go along the lower 
arm (since it was not seen by M). 

We can use this effect for computational advantage as follows. Consider an idealised 
quantum computer which is an isolated physical system containing an on/off switch, a 
set of program/data registers denoted by the state |comp) and an output register. The 
on/off switch is a two- level system with basis states |on) and |off) and the output register 
is a two-level system with basis states |0) and |1). The program/data registers are set up 
("programmed") to solve some given decision problem together with its input (e.g. it might 
be programmed to test for primality together with a given input integer.) The output 
register, initially in state |0) will be set by the computation to |0) or |1) according to the 
answer of the decision problem. The length T of the computation is a known function of 
the input. The time evolution of the computer for time T is given by 

| on) |comp) |0) — ► |on) |comp) |r) 
| off) |comp) |0) -► | off) |comp) |0) 

Here r = or 1 is the (initially unknown) result of the computation and the computation 
will run only if the switch is set to "on" . The result is written into the output register and 
all program/data registers are returned to their initial state. 

Heuristically we will relate this scenario to the interferometer as follows. M is the 
quantum computer with \Mq) and \M\) being the states |0) and |1) of the output register. 
The photon is the on /off switch and the two paths are delayed by a time T for the photon to 
eventually arrive at H2. Thus if r = the running of the computation makes no distinction 
between the paths and the photon is always seen in T. If r = 1 the computation (if it ran) 
would distinguish the two paths and we will see the photon at Q with probability \. As 
before, with probability j the photon will register at Q (so that we are sure that r = 1) and 
the output register will be seen to be in state |0). Thus the computation has not run, yet 
we have learnt the result! 

More formally in terms of states of the computer, we first set the on/off switch to the 
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superposition: 

|off) + |on) 



m i |comp) |0) (6) 
and then allow time T (the computation time) to elapse yielding the state 

— j= (| off) |comp) |0) + | on) |comp) |r)) (7) 
a/2 

Next rotate the state of the switch by 

joff) - i=(|off) + |on)) |on) - i=(|off) - |on» 

This yields the state 

1 f(|off> + |on» (|off)-|on)) , 

1 |comp) |0) H r= |comp) |r 



1 A m (|Q) + k)) , . > (|Q)-|r)) ^ 

= ^ (joff) — ^ + |on) - 7 =— j |comp) (8) 

Here r = or 1 according to the (as yet unknown) result of the computation. Next we 
measure the switch to see if it is on or off. Note that if r = then we never see "on" and if 
r = 1 we see "on" with probability 1/2. Suppose that we see "on". Then we know that the 
result of the computation must certainly be r = 1. We then examine the output register 
which will show |0) with probability 1/2. If this happens then the computation has not been 
run (because if it had, then the output register must show |1)). Overall, if the result is 
actually r = 1 then with probability 1/4 we learn the correct result (and know it is correct) 
with no computation having taken place! 

Note that if the actual solution of the decision problem is r = then we will never 
ascertain this from the above procedure because if r = then the output register will 
always show and the switch will always be finally seen to be "off" . But this outcome also 
arises for r = 1 with probability -r and we cannot a posteriori distinguish the two possible 
causes. Correspondingly, if the actual solution is r = 1 then with probability j will we fail 
to ascertain this. 

The above description of the process represented by eqs. (||) to (||) involves some delicate 
interpretational issues. For example, a many-worlds adherent might object that initially 
the switch was set in an equal superposition of being on and off, so even in the subsequent 
case of "no computation taking place" the computer actually did run in another "parallel 
universe" so we cannot claim to get the result for free. One may, to some extent, counter 
this objection as follows: suppose that when the result is really r = 1, the computer is 
also designed to explode at the end of the computation, if it is run. Then using the above 
procedure, in my world I learn that r = 1 and the computer remains unexploded, available 
to do another run. I do not really care if it self-destructs in some "other universe" ! 

The counterfactual quantum computation procedure above may be considerably im- 
proved (using a method inspired by the improvements to the Elizur-Vaidman problem given 
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in |p 21 ) to essentially eliminate the deficencies noted above. As described below, we will 
achieve the following: 
For any given e > 

(i) If the result is r = 0, we will learn this with probability 1 but some computation will 

have taken place. 

(ii) If the result is r = 1, we will learn this with probability 1 — e with no computation 

having taken place. 

Thus for the many- worlds adherent, the universe in which the computation takes place can 
be made to occur with arbitrarily small amplitude 0(y/e) (in the case that r = 1), which 
considerably weakens his/her/its objection. Recall that many basic results in information 
theory and computer science are formulated in an asymptotic framework which allows an 
arbitrarily small failure of some desired property. This occurs for example in the distinction 
between the complexity classes P and BPP |l6| (the latter allowing an arbitrarily small 
probability of a false result) and Shannon's source coding theorem having not perfect fidelity, 
but fidelity 1 — e (for any e > 0) for the signals reconstructed from their coded compressed 
versions. Thus if some undesirable result can be made to occur with arbitrarily small 



(although non-zero) probability then FAPP it may be ignored. [19] 

The improved counterfactual scheme exploits the so-called quantum watched pot effect 
(or quantum Zeno effect) and it goes as follows. We note first that the state |comp) will 
never become entangled with the other registers so we omit it, writing the action of the 
computer as 

|off)|0) - |off)|0) 

|on)|0) -> |on)|r) {> 

Choose an angle a = ^ for N sufficiently large (c.f. later). Then perform the following 
five operations: 

(a) Rotate the switch by angle a. 

(b) Allow the running time T to elapse. 

(c) Read the output register. If it shows then continue. If it shows 1 then discard the 

state and start again from the beginning. 
Remark, (a) and (b) will result in the evolution 

| off) |0) — ► (cos a | off) + sin a |on)) |0) — ► cos a |off) |0) + sin a |on) \r) (10) 

If r = then the output will always show and (c) will result in the state (cos a | off) + 
sin a | on)) |0) with probability 1. If r = 1 then (c) will result in the collapsed state 
| off) |0) obtained with (high) probability cos 2 Jfe. To complete the procedure we: 

(d) Repeat (a), (b) and (c) a further N — 1 times. 

(e) Finally measure the switch to see if it is on or off (assuming that all stages have been 

kept in (c) and (d)). 
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We claim that in (e), if the switch is seen to be "on" then r is certainly (and some 
computation has been done), and if the switch is seen to be "off", then r is certainly 1 and 
no computation has taken place. In the latter case the probability of keeping all stages is 
(cos 2 ^fif) N which tends to 1 as N —> do. Thus by choosing N to be sufficiently large we 
can make the probability of success greater than 1 — e for any given e. 

To see that our claim is correct, note that if r = then the switch is just successively 
rotated from |off) to |on) in N stages and it never entangles with the output register. If 
r = 1 then the state is repeatedly collapsed to |off) |0) so that no computation takes place 
in any stage (because if it did, the output register would show the result r = 1). Indeed the 
waiting in (b) acts as a measurement of "on" versus "off" for the switch (if r = 1) and in 
this case, we are just freezing the switch in its |off) state by frequent repeated measurement. 
This is the quantum watched pot effect. 

Note that according to (i) , if r = then this result is not learnt "for free" . A natural 
question is whether or not there is a counterfactual scheme which yields the information of 
either result (r = or 1) with no computation having taken place. The procedure described 
above may be readily modified to provide a scheme with the following properties: with 
probability 1 — e we learn the result and for either outcome, be it r = or r = 1, it is 
obtained for "free" with probability . We also learn whether or not the produced result 
was obtained for "free" . It remains an open question whether or not each of the two results 
may be obtained for "free" with high probability I — e, or indeed, whether the sum of these 
two probabilities can be made to exceed 1. 
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